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Abstract 

Most of the work on card shuffling assumes that all the cards in a deck 
are distinct, and that in a well-shuffled deck all orderings need to be equally 
likely. We consider the case of decks with repeated cards and decks which 
are dealt into hands, as in Bridge and Poker. We derive asymptotic formulas 
for the randomness of the resulting games. Results include the influence of 
where a poker deck is cut, and the fact that switching from cyclic dealing 
to back-and-forth dealing will improve the randomness of a bridge deck by a 
factor of f3. 

1 Introduction 

Card shuffling has been used as an example of a mixing problem since the early 
part of the twentieth century. The fundamental question may be boiled down to: 
"How fast does repeated shuffling randomize the deck?" We will explore how to 
make mathematical sense of that question below. 

At the same time, in many card games, such as bridge, euchre, and straight 
poker, the players receive "hands" of cards which are dealt from the deck after it 
has been shuffled. A hand has no inherent order; in other words, the sequence in 
which the cards arrive in front of a player is unimportant. Thus the process of 
shuffling and dealing partitions the deck into sets of cards of predetermined sizes, 
and our goal should be to make the partition as unpredictable as possible. This 
might not require fully randomizing the deck. 

For instance, bridge is a game played with 4 players and a 52-card deck. Each 
player receives 13 cards. The usual method is to shuffle and then deal cyclically: 
first card to the player on the dealer's left, next card to the player on his left, and so 
on, clockwise around the table. Why deal this way, instead of just giving 13 cards 
off the top of the shuffled deck to the first player, then 13 to the next player, and so 
on? If the deck were perfectly randomized by the shuffling, the method of dealing 
would not matter. 

One reason not to simply "cut the deck into hands" is to prevent a dishonest 
dealer from stacking the deck through unfair shuffling. But one might also guess, 
correctly, that dealing cyclically augments the randomness of the game when the 
dealer is honest but has not shuffled the deck thoroughly. In this paper we address 
the questions: "How much difference does a dealing method make?" and "Is cyclic 
dealing the best method?" The goal is to show that the answers are "Quite a bit" 
and "No, we can do a lot better!" We present results for bridge and straight poker 
as examples. We also show how to estimate the randomness in a deck that has 
distinct cards and a deck with only two types of cards. 



2 A Brief History of Card Shuffling 



Henri Poincare devoted eight sections of his 1912 book Calcul des Probabilites 
§225-232] to card shuffling. He did not attempt to model any particular kind of 
shuffling, but showed that any shuffling method which meets certain mild criteria, 
if applied repeatedly to a deck, will eventually result in a well-mixed deck — that is, 
with enough shuffles the bias can be made arbitrarily small. About the same time 
Markov [15] was creating the more general theory of Markov chains, and he often 
used card shuffling as an example. The verdict of history seems to be that Markov 
justly deserves credit for the theory named after him, but that Poincare anticipated 
some of Markov's ideas in his work on card shuffling. Most subsequent work on 
shuffling has approached the problem as a Markov chain. 

In the 1950s Gilbert and Shannon [12] considered the problem of riffle shuffling. 
Riffle shuffling is the most common method used by card players to randomize a 
deck: the shuffler cuts the deck into two packets, then interleaves (riffles) them 
together in some fashion. Using the new science of information theory, Gilbert and 
Shannon began the inquiry into how fast riffle shuffling mixes a deck. In the 1980s 
Reeds jT7] and Aldous [1] added the assumption that all cut / riffle combinations are 
equally likely, and that has become known as the Gilbert-Shannon-Reeds or GSR 
model of card shuffling. 

In 1992, in the most celebrated paper on card shuffling to date [3], Bayer and 
Diaconis generalized the GSR shuffle to the a-shuffle. Let a be a positive integer, 
and cut a deck into a packets (one imagines an a-handed dealer in a futuristic 
casino), then riffle them together in some fashion. Assuming as before that all 
cut/riffle combinations are equally likely, performing a randomly selected a-shuffle 
followed by a randomly selected 6-shuffle turns out to be equivalent to performing 
a randomly selected ao-shuffle. In particular that means that a sequence of k GSR 
shuffles is equivalent to a single 2 fe -shuffle. Thus if we understand a-shuffles we 
implicitly understand repeated shuffles. 

Bayer and Diaconis found an explicit formula for the probability of a particular 
permutation tt after an a-shuffle, namely 



is the number of descents in the permutation 7r. 
Note. There are two ways to view a permutation: 

1. as a bijection 7r from {1,2, ...,n} to iteslf, so that 7r, when applied to a 
sequence of objects, moves the object in position i to position ir(i), and 

2. as an ordering a — ax, 02, . . . , o~ n of 1, 2, ... ,n. 

In this paper our decks will contain repeated cards, and our permutations will act 
to rearrange them. So we will consistently interpret permutations as maps. This 
may disorient readers who are used to the other viewpoint. 

In order to analyze the progress of a shuffler toward a well-mixed deck, we need a 
measure of how close the distribution after an a-shuffle is to the uniform distribution 




(1) 



where n is the size of the deck and 



des(7r) := # {i : Tr(i) > w(i + 1)} 
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Figure 1: The variation distance from uniform of a distinct 52-card deck after an 
a-shufflc. 



(all orderings equally likely). For this Bayer and Diaconis use variation distance 
from uniform, which may be defined as 

\K-U\\:=± Yl Kfr)-U(*)\, (2) 

7T£S n 

where U represents the uniform distribution on permutations, i.e., U(jr) — l/n\ for 
all 7r 6 S n . In terms of cards, the most biased game one could play with the shuffled 
deck is the two-player game in which player 1 wins whenever the ordering of the 
deck has probability higher than it should be under the uniform distribution. So 
1 1 P a — U 1 1 is the maximum bias toward any player in any game one might care to 
play with the shuffled deck. That is, the probability of any set of permutations is 
within ||P„ — U\\ of what it would be under the uniform distribution. 

Using the probability formula in (UJ) and the knowledge that the number of 
permutations in S n with d descents is the well-studied Eulerian number (™,) (see for 
example [13] . [4], [18]), Bayer and Diaconis are able to compute 

,, 1 v— I /n\ 1 fa + n — d— l\ 1 

very quickly. The result, for n = 52 cards and a between 1 and 1024, is graphed in 
Figured] The horizontal scale is logarithmic to represent the fact that a 2 fe -shuffle 
is the same as k GSR shuffles. 

The "waterfall" shape of the graph is typical of what Aldous [I] calls "rapidly 
mixing Markov chains" : negligible change after the first few GSR shuffles, followed 
by a fast approach to uniform ("the cutoff"), and eventually halving with each 
extra shuffle. Note that if we cast the problem in terms of a-shuffles, the ultimate 
exponential decay means the variation distance approaches K\/a for some constant 
K\ as a gets large. 

Many more papers on card shuffling and its applications have been published. 
See QJj] for a survey and [14] for an excellent exposition of the Bayer and Di- 
aconis results. There are other choices besides variation distance for measuring 
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randomness — see pp. 22-23], [3J §5.1], §5], [HJ §3], and §1] for alterna- 
tives. 



3 Repeated Cards and Dealing Methods 

The implicit assumption in Bayer and Diaconis and most other work on card shuf- 
fling is that all cards in the deck are distinct. (A recent exception is [2].) Here we 
consider the alternative: suppose a deck D is a sequence of cards, each of which 
has a value taken from some fixed set of values, and we allow two cards to have the 
same value. 

This complicates the problem in the following ways: 

• Decks (ordered sequences of cards) and transformations between decks can no 
longer be identified with permutations. Instead for each pair of decks there is 
a set of permutations which transform the first into the second (and a different 
set which goes the other way). The transformation sets are easy to describe, 
but it is difficult to find their probability after shuffling. 

• The initial order of a deck, and not just its composition, affects how fast the 
distribution approaches uniform. 

If D 1 is some rearrangement of D, let T(D, D') be the set of permutations 
which, when applied to D, result in D' . (T(D,D r ) is a left coset of the stabilizer 
stab(D) = T{D, D) and a right coset of stab(D') = T(D', D').) Thus the probabil- 
ity of obtaining D' as a result of a-shuffling D is 



\(D^D'):= £ W = ^b d 



a + n — d — 1 



a * — ' \ n 

ttGT(D,D') d 

where is the number of permutations in T(D, D') with d descents. In this context 
we call D the source deck, D' the target deck, and 

T>(D,D';x):= ^ x dcsM = ^ b d x d (3) 

TreT(D.D') d 

the descent polynomial of D and D' , (The reader will kindly forgive the many 
different uses of the letter "D" in this paper. D will always be a source deck, D' a 
target deck, d an integer which represents a number of descents, and T> the descent 
polynomial.) 

If we give D an a-shuffle, then the distance between the resulting distribution 
on decks and the uniform distribution is 



D'eO{D) 



(4) 



where O(D) is the set of reorderings of D (i.e., D's orbit when acted on by S n ) and 
N is the size of O(D). We will refer to this as the fixed source case. 

On the other hand, suppose we are playing bridge. Here all the cards have 
distinct values; let e(l), e(2), . . . , e(52) be the initial order of the cards. The dealer 
a-shufflcs the deck and then deals it out to the four players, who are referred to as 
North, East, South, and West. Since each player receives 13 cards, we can describe 
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a method of dealing cards as a sequence of 13 N's, 13 E's, 13 S's, and 13 W's. For 
instance, 

D' := NESWNESW • • - NESW = (NESW) 13 

represents cyclic dealing, where the top card goes to North, the second to East, 
etc. We can describe a particular partition of the deck into hands by a string of the 
same type. For example, 

D := NSEENNWEWSSWESWNNNEESSSSSESWWNNSENWSEWSWWWEENEWNNNWE (5) 

refers to the partition in which the North player gets cards e(l), e(5), e(6), . . . , e(50), 
the East player gets cards e(3), e(4), e(8), . . . , e(52), and so on. So D, like D', is a 
function from {1,2,..., 52} to {N, E, S, W}, and D(i) is the player who receives card 
e(i). 

In order for the North player to receive the cards assigned by D, the shuffle must 
move those cards to the positions occupied by N's in D', and likewise for the other 
players. In other words, if we think of D and D' as decks with cards of value N, E, 
S, and W, the shuffle will produce the desired partition if and only if it takes D to 
D' . Thus we already have a notation for the distribution over partitions that an 
a-shuffic followed by the dealing method D' produces, and the variation distance 
between that distribution and uniform is 
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DeO(D> 



\(D -> D') - 1 



(6) 



where N is now the number of possible partitions. This is the fixed target case. 
Note that despite the strong similarity between (|4|) and ([6]), fixed source and fixed 
target are dual problems, not identical, because the transition probability P a (-D — > 
D') is not symmetric. 



4 Calculating Transition Probabilities 

Unfortunately, computing ||P a — U\\ precisely for realistically sized decks is pro- 
hibitively complicated in both the fixed source and fixed target cases. Both cases 
require knowledge of the transition probabilities ¥ a (D —> D') in order to calculate 
variation distance from uniform. Conger and Viswanath [7] showed that calcula- 
tion of transition probabilities is computationally equivalent to calculation of the 
coefficients of the descent polynomial T>(D, D'; x). Unfortunately, the same authors 
subsequently found j8] that for certain decks the calculation belongs to a class of 
counting problems called #P, and is in fact #P-complete. As with NP-complete 
problems, it is generally believed that #P-complete problems do not admit efficient 
solutions. 

Barring a method for calculating variation distance without first computing 
transition probabilities, the question shifts to approximation. Theorem [T] (below) 
will allow us to approximate transition probabilities when a is large, given some 
simple information about the decks. To that end, here are some definitions: 

If u and v are card values, we say that D has a u-v digraph at i if D(i) = u 
and D(i + 1) = v. We say that D has a u-v pair at if i < j, D(i) = u, 

and D(J) = v. The distinction between digraphs and pairs is akin to that between 
descents and inversions in a permutation. Let 

W(D, u, v) := # {u-v digraphs in D} — # {v-u digraphs in D} , (7) 
Z(D, u, v) := # {u-v pairs in D} — # {v-u pairs in D} . (8) 



5 



For example, the deck D = ABAAABABB has 3 A-B digraphs and 2 B-A digraphs, 
so W(D, A,B) = 3-2 = 1. D has 15 A-B pairs and 5 B-A pairs, so Z(D, A,B) = 
15 — 5 = 10. Note that both W and Z are antisymmetric in u and v. W(D, u, v) — 
-W(D,v,u) and Z(D,u,v) = -Z{D,v,u). 

In the theorem below we assume that D is a deck of n cards. For each card 
value v, n v is the number of cards in D with that value. For convenience we assume 
an implicit order on the values; the particular order chosen is arbitrary and does 
not affect the result. 

Theorem 1. Suppose D is as above, and D' is a reordering of D. Then 

¥ a (D -+£>') = -! + c^D^D 1 )^ 1 + 0(a- 2 ), 
where N is the number of reorderings of D and 

d (D, D') = —Y W(D > V)Z{D '> % V) ■ (9) 
2N^ 

We begin with a plausibility argument in favor of formula (0). An a-shufflc 
begins with an a-cut, which arranges the cards into a piles. If a is very large, 
most piles will have size or 1. If no pile has two or more cards, then no ordering 
information will survive the shuffle. The next most likely case is that the a-cut 
produces just one pile with two cards u and v. Should this happen, these cards 
must come from a u-v digraph in D. When the cards are reassembled, these two 
must remain in order, becoming a u-v pair in D' . Thus the main source of bias 
in P a (D — > D') is the relationship between digraphs in D and pairs in D' . This 
suggests the formula W(D, u, v)Z(D', u, v). 

Proof of Theorem [7J Let S be the size of the stabilizer of D in S n - Since 
T(D, D') is a coset of the stabilizer, its size is S also, and since there are N such 
cosets, NS = n\. Let bd be the number of permutations in T(D, D') with d descents; 
so Ed b d = S. Then 



/ „ 1 v ^ . / n + a — d — 1 

a d 

1 ^ (a-d)(a-d + l)---(a-d + n-l) 



a * — ' n 

a 



n\ t-r 1 \ a I \ a 



\ d d 



I - <l \ ( n - 1 - d s 



nd + 0(a~ 2 ) 



n(n — 1) 



So the constant term is 1/N, which is what we expect: it means that if we shuffle D 
for long enough, the probability of obtaining any particular deck approaches 1/N, 
i.e., the distribution on decks approaches uniform. The coefficient of a -1 is 

V b d (n -l-2d) = —E(n - 1 - 2 des(Tr)), 
2NS ^ V J 2N y V >h 

d 

where 7r is a permutation chosen uniformly from T(D, D') and E represents expecta- 
tion. Recall that descents are positions i with 1 < i < n—1 such that tt(i) > n(i + l). 
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The other positions are ascents, and if we denote their number by asc(7r) then we 
must have des(7r) + asc(-7r) = n — 1. So the number we are after is 



n-1 

^E(asc(7r) - des(vr)) = JL £ E^(vr), (10) 



where 

1 if 7r(i) < 7r(i + 1), 



Wi(7r) 



-1 if 7r(i) > 7r(i + 1). 



Let the first card in D have value u and the second have value v. Suppose first that 
u = v. 7r must take those two cards to two positions in D' which have value u, but 
otherwise it has no reason to prefer any particular destinations; thus it is equally 
likely that tt(1) < tt(2) as that tt(1) > tt(2). So if u = v then Eu)x(n) = 0. 

On the other hand, suppose u ^ v. Then ir picks a destination for the top card 
uniformly from among those j for which D'(j) — u, and likewise 7r(2) is chosen 
uniformly and independently from D' _1 (f;). We will have u)i(tt) — 1 if the first 
choice is less than the second, that is, if tt maps {1,2} to a u-v pair in D' . Wi(tt) 
will be — 1 if 7r maps {1,2} to a v-u pair in D' . Each pair is equally likely, so 

^ # {u-v pairs in £)'} — # {w-u pairs in D'} Z(D' , u, v) 

# {u-v pairs in D'} + # {v-u pairs in D'} n u n v 

All the other Wj are calculated in the same way, and if we group them according to 
the values of the digraphs in D, we get the desired result for c\. □ 

5 The Fixed Source Case 

Now that we can approximate the transition probability between two decks, we can 
approximate variation distances. In the case of a fixed source deck D we have 



\K-u\\ = \ £ 



2 

D'eO(D) 



a\ I N 



Kia" 1 +0(a 2 ), 



where 



{D):=\ J2 M D > D ')\- ( n ) 



Kl = Ki x ^ , } 



D'eO(D) 



5.1 All-Distinct Decks 



For example, consider shuffling a deck D containing n distinct cards. Without loss 
of generality we may assume that D — 1,2, . . . ,n, i.e., that the value of the card in 
position i is i. Each reordering of D is produced by a unique permutation, so 



2 

Since each card appears once and all the digraphs in D are of the form (i, i + 1) we 
can reduce © to 

n-1 

cx(D,ttD) = ^Z(^,i,i+l). 

i=l 
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Figure 2: The variation distance from uniform, and first-order approximation, of a 
distinct 52-card deck after an a-shuffle. The actual variation distance is graphed in 
black, and the first order approximation Ki/a, with Ki = 44.05710497, is graphed 
in gray. 

Z(ttD, i, i+ 1) is 1 if tt has an ascent at i and — 1 if 7r has a descent at i, so we have 
K ^ = m £ Nc(7T)-des(7r)| = ^W^\|n-l-2d|. (12) 

The Eulerian numbers can be calculated using a simple recurrence [13] , so (1121) is 
all we need to find the long-term behavior of variation distance from uniform after 
shuffling a deck of n distinct cards. For instance, when n — 52 we find that n\ is 

146020943891326775423340146124729913263177343486982212261189487693 
3314356310443124530393681659122442758682178888925184000000000000 ' 

which is approximately 44.06. (In general, K\ is about ri\J (n + 1) /24-7T for a deck of 
n distinct cards [BJ.) So after giving a deck of 52 distinct cards an a-shuffie, where 
a is large, the variation distance from uniform will be approximately 44.06/a. We 
can compare that with the exact results Bayer and Diaconis obtained for the same 
deck, to see how big a has to be to make the approximation a good one. Figure [5] 
shows the result. 

5.2 Decks with Two Card Types 

Let Gk be a complete directed graph on k vertices, with loops at each vertex. A 
deck with cards of k different types may be thought of as a walk on Gk , where the 
starting position is the top card and the ending position is the bottom card. Each 
edge in the walk represents a digraph in the deck. 
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Consider a deck D with only two types of cards, which we will unimaginatively 
label 1 and 2. If the deck begins with 1 and ends with 2, then the corresponding 
walk must have traversed the 1-2 edge once more than the 2-1 edge, and there- 
fore W(D, 1,2) = 1. Likewise beginning with a 2 and ending with a 1 makes 
W(D, 1, 2) = — 1. But beginning and ending with the same type of card means that 
both edges were traversed the same number of times, so in that case W(D, 1, 2) is 
0. With only two types of cards © reduces to 

ji 

d(A D') = -W(D, 1, 2)Z(D', 1, 2), 

which vanishes if W(D, 1,2) is 0, for all D 1 . So if the top and bottom cards of 
the unshufflcd deck are the same, n\ will be 0, meaning that in the long run the 
variation distance decreases at least as fast as some multiple of a~ 2 . So we have 
the surprising result that a shuffler of this type of deck can greatly speed the deck's 
approach to randomness by making sure that the top and bottom cards are the 
same before he begins shuffling. 

We will concentrate on the fixed target case for the rest of this paper, but the 
interested reader may consult [B] for more results in the fixed source case. 



6 The Fixed Target Case (Dealing into Hands) 

If we fix the target deck (i.e., dealing method) D', we have 



■a-U\\ = 



DeO(D') 



\(D -> D') - 



N 



= Kia 1 +0(a 2 ), 



where 



DeO(D') 



This sum seems intractably large, so it is useful to have an alternative algorithm 
for the calculation of Tti (£)'). First notice that 



{D') = \ J2 MA^OI 

DeO(D') 



DeO(D') 



n s -^W(D,u,v)Z(D / ,u,v) 



n 

4N 
n 

AN 



E 



E 

DeO(D') 

E \ e ^\ 

DeO(D f ) 



Z(D',D(i),D(i + l)) 



n D{i)nD{i+l) 



where 



0(D) := E 



Z(D',D(j),D(i + l)) 



For our cases, there are far fewer possible values for 9(D) than there are decks 
D G O(D'), so we want to reason about the distribution of values of 9(D) as D 
ranges over O(D'). To prepare a recursion, we will need this distribution to depend 
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also on the last card of D, and we will need to consider decks with fewer cards than 
D. Since O(D') depends on the number of cards of each type in D', but not on 
their order, let m — (mi, m-2, . . . , rrik) be an integer vector, representing a collection 
of mi < ni cards labeled 1, < 112 cards labeled 2, etc. If v £ {1, . . . , k} is a card 
value, we write D H u to mean that the last card of D is i>, and H to to mean 
that D ends with the digraph uv. We also write: 



0(m) 



C(l mi 2 m2 •••fc mfc ) if all mi > 0, 
otherwise, 



grn.v(t) ■= ^ t 



6(D) 



DeO(m) 
D-\v 

Then gn, v (t) will record the distribution of interest. 

Let e v be the standard basis vector with a 1 in coordinate v and Os in all other 
coordinates. We derive the recurrence by considering the second-to-last card: 



.97 



.,„(*)= £ ^)=x: e * e(D) as) 



DGO(m) u DeO(nT) 

D-\v D-\uv 



u DeO(m~e v ) 
D-\u 



This enables us to find Ki(D ) by recursively computing gm,v{t) for each card 
value v and for each integer vector m with (0, . . . , 0) < m < (ni, . . . , rifc). There 
are £;J}(ni + 1) generating functions to compute, which is feasible for the cases we 
discuss. 



6.1 Straight Poker 



Straight poker is a game in which players receive 5 cards each from a deck of 52 
distinct cards. The remaining cards are unused. Dealing is traditionally cyclic, so 
with 4 players the normal deal sequence is 



D' 



poker 



(1234) 5 5 



where 1, 2, 3, and 4 represent the players and 5 is the "hand" of unused cards. The 
reader may check that Z(D' o)sel , u, v) is the row u, column v entry of 



^(-Cpokcr) — 



which allows us to use (H 



V 



5 


•5 


•5 


160 


-5 


5 


5 


160 


-5 -5 





5 


160 


-5 -5 


-5 





160 


160 -160 


-160 


-160 






K l(^poker) 



to calculate 

1041539930128654272599 
123600572196960202344 



8.427. 



The usual procedure in poker, however, is for the dealer to shuffle the cards and 
then allow the player to his left to cut them — that is, move some number of cards 
from the top to the bottom — before dealing. Fulman [IT] showed that with a deck of 
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Figure 3: The effect on K\ of cutting a poker deck at position to. 

distinct cards, a shuffle followed by a random cut was no more effective a randomizer 
than the shuffle alone. 

When the deck will be dealt into hands, however, choosing a particular cut can 
enhance the randomness. Moving k cards has the same effect as making the deal 
sequence a k D' pokcl , where a is the cycle (1,2, ... , 52). The problem is small enough 
that we can simply try all possible cuts and report Tti for each; the result is in 
Figure [3J The best place to cut the deck is after the 16th card, making Z(u, 5) = 
for each player u. We find 



523485619699747366033 
126685078454994859800 



4.132. 



Thus a good cut can effectively halve the value of K\ , meaning it is worth one extra 
GSR shuffle. The method of the next section can be used to improve the situation 
further. 



6.2 Bridge 

As described in Section |3l a method of dealing bridge can be identified with a target 
deck D' e 0(N 13 E 13 S 13 W 13 ). We have n = 52 and n v — 13 for each card value v, so 



IZN 



E 



DeO(D') 



J2HD',D(i),D(i + l)) 



(14) 



Suppose we deal a game of brige by "cutting the deck into hands." That is, the 
top 13 cards go to North, the next 13 to East, etc. Call this "ordered dealing." 
Symbolically, 

D' md = N 13 E 13 S 13 W 13 . 
There are 169 N-E pairs in D' old and no E-N pairs, so Z(D old ,N,E) — 169. Likewise 
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for the other card values, so 



Z(D' old ) = 169 



( ° 

-1 

-1 

v -i 



1 1 

o 1 

-1 o 

-i -i 



1 \ 
i 

i 



where we give the values the implicit ordering N < E < S < W and interpret the 
entries in the matrix accordingly. Using (|13|) we can compute 



93574839271687495932003418573 
3352796110343049552452340000 



27.91. 



Thus in the long run only slightly less shuffling is required for a bridge deck that 
will be cut into hands than for a deck in which all orderings are distinct. 
Of course the way that most bridge players deal is cyclically: 



D' = D' = (NESW) 



13 



In that case the reader can check that 



13 



( 

-1 

-1 

v -i 



1 \ 

1 
1 

o / 



which is to say, Z(D' cyc , u, v) = j^Z(D' 0] . d , u, v) for all pairs of card types. It follows 
then from (fTU) that 



^lPcyc) = ^lPord) = 



7198064559360576610154109121 
3352796110343049552452340000 



2.147. 



So dealing cyclically works 13 times as well as simply cutting the deck into hands. 
That is to say, in the long run, switching from ordered to cyclic dealing is worth an 
extra log 2 (13) ~ 3.7 2-shufHes. 

The reason cyclic dealing is so much better than cutting into hands is that 
it makes Z(D,u,v) small by better balancing the number of u-v pairs with the 
number of v-u pairs, for all u and v. If a is considerably larger than the deck size 
n, it is likely that when the deck is partitioned into a packets, all of the packets 
will either be empty or contain exactly one card. If such is the case, they will be 
riffled together in an arbitrary order, and the deck will be perfectly randomized. In 
fact, that suggests an alternate way to estimate how many shuffles are needed to 
adequately randomize a deck, an idea due to Reeds [17] and reported in Diaconis 
[9]. Simply calculate the likelihood that each card is in a different packet after the 
cut (this is the celebrated "birthday problem" of combinatorics) , and pick a large 
enough that the probability is high. 

The new idea here is that a dealing method can help ameliorate the bias in the 
case where a is still reasonably large, but some packet contains two cards. Imagine 
that the cards are initially arranged from "best" to "worst" before shuffling. Then a 
packet with two cards contains a "good" card atop a "worse" one, and after riffling 
the two cards will remain in the same order, though other cards may come between 
them. This is the source of the bias which remains even when a is large. If the 
two cards are dealt to players u and v, then we would like it to be approximately 
equally likely that u gets the good card and v the bad as the other way around. 
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Figure 4: Three styles of bridge dealing, represented by lattice paths. The grid 
on the left represents ordered dealing, the center is cyclic dealing, and the one on 
the right is back-and-forth dealing. Each grid shows the sequence of N and E cards 
as north and east line segments respectively. The size of the Young shape to the 
northwest of the path is the number of E-N pairs in the target deck, and the size of 
the complementary shape is the number of N-E pairs. 



Thus we would like there to be about as many u-v pairs in the dealing method as 
there are v-u pairs. 

Consider just the North and East players in bridge. We can describe a dealing 
method D' as it applies to those players by drawing a north-east lattice path starting 
from the lower-left corner of a 13 x 13 grid. That is, traverse D 1 and draw a 
north segment whenever an N is encountered and an east segment whenever an E is 
encountered. (The traditional names of players are very fortuitous for this exercise.) 
Every square to the southeast of the path has a northward segment to its left and 
an eastward segment above it, so it corresponds to a N-E pair in D' . Likewise the 
squares in the Young shape to the northwest of the path represent E-N pairs. 

Figure H] shows the paths and shapes for ordered and cyclic dealing on the left 
and in the middle. Cyclic dealing is much better than ordered dealing because the 
path stays near the diagonal of the grid, so about half the squares are on either 
side. 

However, it always stays to one side of the diagonal, and thus it is easy to see 
that we can do better! The path on the right side of Figure H] corresponds to 

D' h{ = (NESWWSEN) 6 NESW, 

and by crossing the diagonal it balances the two sides as well as can be done, making 
Z(D' hf ,N 7 E) = 1. Likewise for the other pairs of players, so we have 



Z(D' t 



bf) 



/Oil 1\ 
-10 11 
-1-1 1 

V -l -l -l o J 



and therefore 



1 7198064559360576610154109121 
«i( bf)-^ K H eye) - 43 586 349434459 6 44 18 i 88 0420000 ~ 

The dealing method described by D' b{ may be called "back-and-forth" dealing, since 
the dealer hands out cards once around the table clockwise, then once counterclock- 
wise, then clockwise, counterclockwise, etc. We have shown that in the long run 
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Deck 


Method 


a = 16 


32 


64 


128 


256 


512 


1024 


oz uisimci 
123 - • ■ (52) 


Exact 
44.0571a" 1 


l.UUUU 
2.7536 


u.yzo ( 
1.3768 


U.Dloo 
0.6884 


n *3 Q A 1 

0.3442 


0.1721 


U.Uoo4 
0.0860 


u.U4zy 
0.0430 


Ordered Bridge 
N 13 E 13 S 13 W 13 


Monte Carlo 
27.9095a" 1 


0.9902 
1.7443 


0.7477 
0.8722 


0.4230 
0.4361 


0.2183 
0.2180 


0.1104 
0.1090 


0.0550 
0.0545 


0.0274 
0.0273 


Cyclic Bridge 
(NESW) 13 


Monte Carlo 
2.1469a" 1 


0.2349 
0.1342 


0.0735 
0.0671 


0.0346 
0.0335 


0.0169 
0.0168 


0.0084 
0.0084 


0.0042 
0.0042 


0.0021 
0.0021 


Back-Forth Bridge 
(NESWWSEN) 6 (NESW) 


Monte Carlo 
0.1651a" 1 


0.3118 
0.0103 


0.0260 
0.0052 


0.0073 
0.0026 


0.0022 
0.0013 


0.0008 
0.0006 


0.0003 
0.0003 


0.0002 
0.0002 



Table 1: Variation distances from uniform after an a-shuffle for a deck of distinct 
cards and 3 methods of dealing bridge; a = 16 means 4 riffle suffies, a — 32 is 5 
riffle shuffles, etc. 



(i.e., for large a), back-and- forth dealing is 13 times as effective as cyclic dealing. 
Or, switching from cyclic to back-and-forth dealing is worth log 2 (13) « 3.7 extra 
GSR shuffles. 

We could apply the same strategy to the poker deck that was cut after the 16th 
card; the combination of that cut and back-and-forth dealing produces a Ki which 
is 1/5 what it was with cyclic dealing. 

7 Two Big Questions 

1. How big does a have to be for the first-order approximation to be 
a good one? 

By manipulating absolute value signs, one can show [6] that the error between 
variation distance from uniform and the first order estimate Kia^ 1 or Kia" 1 
is bounded above by 

J_/a + n-d-l\l 1 (2zl- d ) . 

a n \ n J n\ a(n - 1)1 \ 2 / 

This bound, which depends only on the size of the deck (n) and the size of the 
shuffle (a), could undoubtedly be improved. Figure [2] shows that in the case 
of a deck of 52 distinct cards, the first-order estimate becomes quite good at 
about the point of the cutoff. 

So, how many times does a bridge player need to shuffle in order to see the 
promised benefits of switching dealing methods? Monte Carlo estimates (see 
[6] for a full explanation of methods and confidences) give strong evidence that 
back-and-forth dealing beats cyclic dealing after 5 or more shuffles. Table [1] 
shows the results. 

2. How good is the GSR model? 

The GSR model represents idealized riffle shuffling, in the sense that every 
possible shuffle is equally likely. Human shufflers vary in their skill and "neat- 
ness," sometimes clumping cards together too much, sometimes not enough. 
The question is whether the conclusions drawn here from the GSR model will 
still hold when the model is replaced with the way real people shuffle cards. 
This is a topic for future work. 
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